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RAW SEQUENCE LISTING DATE: 12/27/2002 

PATENT APPLICATION: US/09/858 , 332A TIME: 14: 16*: 41 

Input Set : D:\seqlist.txt RFOFIVFD 
Output Set : N:\CRF4\12272002\l858332A.raw rlCwL-l V L-L^ 

4 <110> APPLICANT: Tchaga, Grigory S. JAN 0.6 2003 

5 Jokhadze, George 4CAnfOGflfi 

7 <120> TITLE OF INVENTION: Metal Ion Affinity Tags and ■ Methods fo^Q^ CENTER I OUU/tyuU 

8 Using the Same 

11 <130> FILE REFERENCE: CLON-056CIP 

13 <140> CURRENT APPLICATION NUMBER: US 09/858, 332A 

14 <141> CURRENT FILING DATE: 2001-05-15 

16 <150> PRIOR APPLICATION NUMBER: 09/404,017 

17 <151> PRIOR FILING DATE: 1999-09-23 

19 <150>< PRIOR APPLICATION NUMBER: 60/101,867 

20 <151> PRIOR FILING DATE: 1998-09-25 
22 <160> NUMBER OF SEQ ID NOS : 20 

24 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

26 <210> SEQ, ID NO: 1 

27 <211> LENGTH: 16 

28 <212> TYPE: PRT 

29 <213> ORGANISM: Artificial Sequence 

31 <220> FEATURE: 

32 <223> OTHER INFORMATION: affinity peptide 

34 <4 00> SEQUENCE: 1 

35 His Leu lie His Asn Val His Lys Glu Glu His Ala His Ala His Asn 

36 1 5 10 15 

39 <210> SEQ ID NO: 2 

40 <211> LENGTH: 18 

41 <212> TYPE: PRT 

42 <213> ORGANISM: Artificial Sequence . - • 
4 4 <220> FEATURE: 

4 5 <223> OTHER INFORMATION: affinity peptide 

47 <400> SEQUENCE: 2 

48 His' Asp Asp His Asp Asp His Asp Asp His Asp Asp His Asp Asp His 

49 1 5 10 15 

50 Asp Asp 

54 <210> SEQ ID NO: 3 

55 <211> LENGTH: 16 ' ' 

56 <212> TYPE: PRT 

57 <213> ORGANISM: Artificial Sequence 

59 <220> FEATURE : 

60 <223> OTHER INFORMATION: affinity peptide 

62 <4 00> SEQUENCE: 3 

63 His Glu Glu His Glu Glu His Glu Glu His Glu Glu His Glu Glu His 

64 1 5 10 15 

67 <210> SEQ ID NO: 4 

68 <211> LENGTH: 18 
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Output Set: N:\CRF4\12272002\l858332A.raw 

69 <212> TYPE: PRT _ 

70 <213> ORGANISM: Artificial Sequence 

72 <220> FEATURE: 

73 <223> OTHER INFORMATION : affinity peptide 
75 <400> SEQUENCE: 4 ■ 

7 6 His Asp Glu His Asp Glu His Asp Glu His Asp Glu His Asp Glu His 

77 1 5 10 15 

78 Asp Glu 

82 <210> SEQ ID NO: 5 

83 <211> LENGTH: 18 

84 <212> TYPE: PRT 

85 <213> ORGANISM: Artificial Sequence 

87 <220> FEATURE: 

88 <223> OTHER INFORMATION: affinity peptide 

90 <4 00> SEQUENCE : 5 

91 His Glu Asp His Glu Asp His Glu Asp His Glu Asp His Glu Asp His 

92 1 5 10 15 

93 Glu Asp 

97 <210> SEQ ID NO: 6 

98 <211> LENGTH: 4 

99 <212> TYPE: PRT 

100 <213> ORGANISM: Artificial Sequence ~ 

102 <220> FEATURE: 

103 <223> OTHER INFORMATION: enterokinase . cleavage site 

105 <400> SEQUENCE: 6 

106 He Glu Gly Arg 

107 1 

110 <210> SEQ ID NO: 7 

111 <211> LENGTH: 6 

112 <212> TYPE: PRT 

113 <213> ORGANISM: Artificial Sequence 

115 <220> FEATURE: 

116 <223> OTHER INFORMATION: a factor Xa cleavage site 

118 <400> SEQUENCE: 7 

119 Leu Val Pro Arg Gly Ser 

120 1 5 

123 <210> SEQ ID NO: 8 

124 <211> LENGTH: 8 

125 <212> TYPE: PRT 

126 <213> ORGANISM: Artificial Sequence 

128 <220> FEATURE: 

129 <223> OTHER INFORMATION: a thrombin cleavage site 

131 <400> SEQUENCE: 8 

132 His Pro Phe His Leu Val He His 

133 1 • 5- 

136 <210>. SEQ ID NO: 9 

137 <211> LENGTH: 10 

138 <212> TYPE: PRT 

139 <213> ORGANISM: Artificial Sequence 
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PATENT APPLICATION-: US/09/858 , 332A TIME: 14:16:41 

Input Set : D:\seqlist.txt 

Output Set: N:\CRF4\12272002\l858332A.raw 

141 <220> FEATURE: 

142 <223> OTHER INFORMATION : a renin cleavage site 
14 4 <4 00> SEQUENCE: 9 

14 5 Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
146 1 5 10 

149 <210> SEQ ID NO: 10 

150 <211> LENGTH: 8 

151 <212> TYPE: PRT 

152 <213> ORGANISM: Artificial Sequence 

154 <220> FEATURE: 

155 <223> OTHER INFORMATION: an immunological tag 

157 <4 00> SEQUENCE: 10 

158 Asp Tyr Lys Asp Asp Asp Asp Lys 

159 1 . 5 

162 <210> SEQ ID NO: 11 ' 

163 <211> LENGTH: 11 

164 <212> TYPE: PRT 

165 <213> ORGANISM: Artificial Sequence 

167 <220> FEATURE: 

168 <223> OTHER INFORMATION:/ an immunological tag 

170 <400> SEQUENCE: 11 

171 Cys Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 
; 172 1 5 10 

* 175 <210> SEQ ID NO: 12 

176 <211> LENGTH: 11 

177 <212> TYPE: PRT 

178 <213> ORGANISM: Artificial Sequence . 
180 <220> FEATURE: 

-181 <223> OTHER INFORMATION: an immunological tag 

183 <4 00> SEQUENCE: 12 

184 Cys Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 

185 1 5 10 

188 <210> SEQ ID NO: 13 

189 <211> LENGTH: 3430 

190 <212> TYPE: DNA 

191 <213> ORGANISM: Artificial Sequence 

193 <220> FEATURE: 

194 <223> OTHER INFORMATION: DNA sequence of vector containing cDNA of 

195 recombinant enterokinase 

197 <400> SEQUENCE: 13 

198 gacgaaaggg cctcgtg.ata cgcctatttt tataggttaa tgtcatgata ataatggttt 60 

199 cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 120 

200 tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 180 

201 aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 24 0 

202 ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 300 

203 ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga 360 

204 tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 420 

205 tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 4 80 

206 actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 54 0 
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207 gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 600 

208 acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 660 

209 gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg "720 

210 acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 780 

211 gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 840 

212 ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900 

213 gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 960 

214 cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 1020 

215 agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 1080 

216 catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 1140 

217 tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200 

218 cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 1260 

219 gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 1320 

220 taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 1380 

221 ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 1440 

222 tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500 

223 ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 1560 

224 cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 1620 

225 agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 1680 

226 gbgcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct 1740 

227 ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc 1800 

228 aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt 1860 

229 ttgctggcct tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg 1920 

230 tattaccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga 1980 

231 gtcagtgagc gaggaagcgg aagagcgccc aatacgcaaa ccgcctctcc ccgcgcgttg 204 0 

232 gccgattcat taatgcagct ggcacgacag gtttcccgac tggaaagcgg gcagtgagcg 2100 

233 caacgcaatt aatgtgagtt agctcactca ttaggcaccc caggctttac actttatgct 2160 
■234 tccggctcgt atgttgtgtg gaattgtgag cggataacaa tttcacacag gaaacagcta 2220 

235 tgaccatgat tacgccaagc ttgaaggatc atctcatcca caatgtccac aaagaggagc 2280 

236 acgctcatgc ccacaacaag atcgatattg tcggaggaag tgactccaga gaaggagcct 2340 

237 ggccttgggt cgttgctctg. tatttcgacg atcaacaggt ctgcggagct tctctggtga 2400 

238 gcagggattg gctggtgtcg gccgcccact gcgtgtacgg gagaaatatg gagccgtcta 24 60 

239 agtggaaagc agtgctaggc ctgcatatgg catcaaatct gacttctcct cagatagaaa 2520 

240 ctaggttgat tgaccaaatt gtcataaacc cacactacaa taaacggaga aagaacaatg 2580 

241 acattgceat gatgcatctt gaaatgaaag tgaactacac agattatata cagcctattt 2640 

242 gtttaccaga agaaaatcaa gtttttcccc caggaagaat ttgttctatt gctggctggg 2700 

243 gggcacttat atatcaaggt tctactgcag- acgtactgca agaagctgac gttccccttc 2760 

244 tatcaaatga gaaatgtcaa caacagatgc cagaatataa cattacggaa aatatggtgt 2820 

245 gtgcaggcta tgaagcagga ggggtagatt cttgtcaggg ggattcaggc ggaccactca 2880 

246 tgtgccaaga aaacaacaga tggctcctgg ctggcgtgac gtcatttgga tatcaatgtg 2940 

247 cactgcctaa tcgcccaggg gtgtatgccc gggtcccaag gttcacagag tggatacaaa 3000 

248 gttttctaca tgagctcgta attagctgag aattcactgg ccgtcgtttt acaacgtcgt 3060 
24 9 gagcctggga aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg 3120 

250 ccagctggcg taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc 3180 

251 tgaatggcga atggcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac 3240 

252 accgcatatg gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagcccc 3300 

253 gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt 3360 

254 acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac 3420 

255 cgaaacgcgc 3430 
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Output Set: N:\CRF4\12272002\l858332A.raw 

257 <210> SEQ ID NO: 14 

258 <211> LENGTH: 279 

259 <212> TYPE: PRT 

260 <213> ORGANISM: Artificial Sequence 

262 <220> FEATURE: 

263 <223> OTHER INFORMATION: protein sequence of vector containing cDNA of 

264 recombinant enterokinase 

266 <400> SEQUENCE: 14 

267 Met Thr Met lie Thr Pro Ser Leu Lys Asp His Leu lie His Asn Val 

268 1 *• 5 10 15 

269 His Lys Glu Glu His Ala His Ala His Asn Lys lie Asp lie Val' Gly 

270 20 .25 30 

271 Gly Ser Asp Ser Arg Glu Gly Ala Trp Pro Trp Val Val Ala Leu Tyr 

272 35 40 n 45 

27 3 Phe Asp Asp Gin Gin Val Cys Gly Ala Ser Leu Val Ser Arg Asp Trp 

274 . 50 55 60 

275 Leu Val Ser Ala Ala His Cys Val Tyr Gly Arg Asn Met Glu Pro Ser 

276 65 70 75 80 

277 Lys Trp Lys Ala Val Leu Gly Leu His Met Ala Ser Asn Leu Thr Ser 

278 85 90 95 

279 Pro Gin lie Glu Thr Arg Leu lie Asp Gin lie Val lie Asn Pro His 

280 100 105 - 110 

281 Tyr Asn Lys Arg Arg Lys Asn Asn Asp lie Ala Met Met His Leu Glu 

282 . 115 120 ' ,125 

283 Met Lys -Val Asn Tyr Thr Asp Tyr lie Gin Pro lie Cys Leu Pro Glu 

284 130 135 140 

285 Glu Asn Gin Val Phe Pro Pro Gly Arg lie Cys Ser lie Ala Gly Trp 

286 145 150 155 160 

287 Gly Ala Leu lie Tyr Gin Gly Ser Thr Ala .Asp Val Leu Gin Glu Ala 

288 165 170 175 

289 Asp Val Pro Leu Leu Ser Asn Glu Lys Cys Gin Gin Gin Met Pro Glu 

290 a 8 0 185 190 

291 Tyr Asn lie Thr Glu Asn Met Val Cys Ala Gly Tyr Glu Ala Gly Gly 

292 195 200 - * 205 

293 Val Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Met Cys Gin Glu 

294 210 215 220 

295 Asn Asn Arg Trp Leu Leu Ala Gly Val Thr Ser Phe Gly Tyr Gin Cys 

296 225 230 235 240 

297 Ala Leu Pro Asn Arg Pro Gly Val Tyr Ala Arg Val Pro Arg Phe Thr 

298 245 250 255 

299 Glu Trp lie Gin Ser Phe Leu His Glu Leu Val lie Ser Glu Phe Thr 

300 260 • 265 270 

301 Gly Arg Arg Phe Thr Thr Ser 

302 275 

305 <210> SEQ ID NO: 15 

306 <211> LENGTH: 12 

307 <212> TYPE: PRT 

308 <213> ORGANISM: Artificial Sequence 
310 <220> FEATURE: 
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VERIFICATION SUMMARY DATE: 12/27/2002 

PATENT APPLICATION: US/09/858 , 332A TIME: 14:16:42 

Input Set : D:\seqlist.txt 

Output Set: N:\CRF4\12272002\l858332A.raw 
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